Prediction of DNA-Binding Propensity of Proteins by the Ball-Histogram Method
نویسندگان
چکیده
We contribute a novel, ball-histogram approach to DNAbinding propensity prediction of proteins. Unlike state-of-the-art methods based on constructing an ad-hoc set of features describing the charged patches of the proteins, the ball-histogram technique enables a systematic, Monte-Carlo exploration of the spatial distribution of charged amino acids, capturing joint probabilities of specified amino acids occurring in certain distances from each other. This exploration yields a model for the prediction of DNA binding propensity. We validate our method in prediction experiments, achieving favorable accuracies. Moreover, our method also provides interpretable features involving spatial distributions of selected amino acids.
منابع مشابه
Searching for Important Amino Acids in DNA-binding Proteins for Histogram Methods
We develop a method capable to identify important amino acids for histogram-based methods predicting DNA-binding propensity. This method can be used both for prediction from sequence information (Tube Histograms) and prediction from structural information (Ball Histograms). We validate our method in prediction experiments using only proteins’ primary structure, achieving favourable accuracies. ...
متن کاملIn silico investigation of lactoferrin protein characterizations for the prediction of anti-microbial properties
Lactoferrin (Lf) is an iron-binding multi-functional glycoprotein which has numerous physiological functions such as iron transportation, anti-microbial activity and immune response. In this study, different in silico approaches were exploited to investigate Lf protein properties in a number of mammalian species. Results showed that the iron-binding site, DNA and RNA-binding sites, signal pepti...
متن کاملBiochemical characterization of PE_PGRS61 family protein of Mycobacterium tuberculosis H37Rv reveals the binding ability to fibronectin
Objective(s): The periodic binding of protein expressed by Mycobacterium tuberculosis H37Rv with the host cell receptor molecules i.e. fibronectin (Fn) is gaining significance because of its adhesive properties. The genome sequencing of M. tuberculosis H37Rv revealed that the proline-glutamic (PE) proteins contain polymorphic GC-rich repetitive sequences (PGRS) which have clinical importance i...
متن کاملSPECTROSCOPIC EVALUATION OF THE INTERACTION OF A TETRAZOLE DERIVATIVE SYNTHESIZED BY SEMI-GREEN METHOD WITH CALF THYMUS DNA AND BOVINE SERUM PROTEIN
Background & Aims: In recent decades, the application of tetrazole structures in various fields of medicine and industry has become very important, because they can cause structural and thus functional changes in the proteins. In this article, the effect of a new tetrazole derivative on calf thymus DNA (Ct-DNA) as well as on bovine serum albumin protein (BSA) in the solution was determined usin...
متن کاملRapid purification of HU protein from Halobacillus karajensis
The histone-like protein HU is the most-abundant DNA-binding protein in bacteria. The HU protein non-specifically binds and bends DNA as a hetero- or homodimer, and can participate in DNA supercoiling and DNA condensation. It also takes part in DNA functions such as replication, recombination, and repair. HU does not recognize any specific sequences but shows a certain degree of specificity to ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011